KVM: Fix agents dont reconnect post maintenance#3239
KVM: Fix agents dont reconnect post maintenance#3239DaanHoogland merged 6 commits intoapache:4.11from
Conversation
yadvr
left a comment
There was a problem hiding this comment.
LGTM, but some changes requested.
|
I'll wait @nvazquez to address comments and will trigger testing on this. |
|
Thanks @rhtyd @borisstoyanov, comments addressed and re-tested functionalities. |
|
@blueorangutan package |
|
@nvazquez a Jenkins job has been kicked to build packages. I'll keep you posted as I make progress. |
|
Packaging result: ✔centos6 ✔centos7 ✔debian. JID-2680 |
|
@blueorangutan test |
|
@nvazquez a Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests |
|
@blueorangutan test |
|
@borisstoyanov a Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests |
|
Trillian test result (tid-3480)
|
|
@blueorangutan package |
|
@nvazquez a Jenkins job has been kicked to build packages. I'll keep you posted as I make progress. |
|
Packaging result: ✔centos6 ✔centos7 ✔debian. JID-2719 |
|
@blueorangutan test |
|
@nvazquez a Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests |
|
Trillian test result (tid-3529)
|
|
@blueorangutan package |
|
@rhtyd a Jenkins job has been kicked to build packages. I'll keep you posted as I make progress. |
|
Packaging result: ✔centos6 ✔centos7 ✔debian. JID-2725 |
|
@blueorangutan test |
|
@rhtyd a Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests |
|
Trillian test result (tid-3542)
|
borisstoyanov
left a comment
There was a problem hiding this comment.
LGTM, manual testing has passed with the automated tests as well
|
I see two approvals and a perfectly passed test suite. Is this still WIP? |
|
ping @nvazquez |
|
Thanks @DaanHoogland @borisstoyanov, this feature is completed |
| } | ||
| try { | ||
| SSHCmdHelper.SSHCmdResult result = SSHCmdHelper.sshExecuteCmdOneShot( | ||
| connection, "service cloudstack-agent restart"); |
There was a problem hiding this comment.
This may be changed to systemctl restart cloudstack-agent || service cloudstack-agent restart
|
@rhtyd @nvazquez this one won't fwd-merge again due to moved files. I'll be looking at it later. |
* 4.11: KVM: Fix agents dont reconnect post maintenance (#3239)
* 4.12: KVM: Fix agents dont reconnect post maintenance (#3239)
|
done. simpler than thought |
Description
Before this fix, there were two possible scenarios when cancelling maintenance/prepare for maintenance on a KVM host:
After cancelling maintenance on one-time SSH password hosts, hosts did not reconnect and were not operational unless a manual restart on the CloudStack agent service was performed.
This feature keeps the connection between management server and host agent alive while preparing for maintenance and when on maintenance. This imples that:
Summary
Types of changes
Screenshots (if appropriate):
How Has This Been Tested?
Tested on 2xKVM hosts environment, NFS primary and secondary storage, changing values of the global setting 'kvm.ssh.to.agent' for each case to test